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The applicability of the Pontryagin maximum principle to signal-noise 
ratio maximization is explored. Attention is focused on the reformulation of 
the problem so that the maximum principle may be used. The basic aspect 
of the reformulation is to cast the problem into the form of differential equa- 
tions instead of integral equatio?is. 

Two problems are solved. The first, a variation of the matched filter 
problem, could have been solved by other methods. However, the maximum 
principle provided a very neat and systematic approach. The second problem, 
signal design with both an energy and an amplitude constraint imposed on 
the signal, is solved numerically. It appears to be intractable by other 
methods. One of the advantages of the maximum principle formulation is 
that, by working with differential equations rather than with integral equa- 
tions, numerical techniques may be more easily used. 

I. INTRODUCTION 

The Pontryagin maximum principle may be considered to be a gen- 
eralization of methods of calculus of variations that permits solution of 
optimization problems with inequality contraints. During the last few 
years, it has been extensively used to attack control theory problems. The 
use of the principle to solve signal optimization problems is introduced 
in Ref. 1. The maximum principle is briefly discussed in connection with 
wave-form optimization in Ref. 2. 

The purpose of this present paper is to develop techniques for the 
application of the maximum principle, in particular to problems of signal- 
noise ratio maximization. We shall show how the maximum principle 
may be used to solve some problems with inequality constraints (e.g., 
the amplitude of a signal may be constrained to be less than or equal to 
some maximum value) which were heretofore considered intractable. 
We shall also show how a problem, solvable by other methods, may be 
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very conveniently attacked with the formalism of the maximum princi- 
ple. 

It is interesting to note that the maximum principle is, in a sense, 
more applicable to communication theory than to control theory for 
which it was originally developed (this is also pointed out in Ref. 1). 
The maximum principle yields a function of time to maximize a func- 
tional subject to constraints and for prescribed initial conditions. The 
answer to most communication theory problems is a function of time. 
On the other hand, in control problems, the function of time for specific 
initial conditions is called an "open loop" solution. What is actually 
needed is the "closed loop" or "feedback" solution which is a function 
of the present state. This is only indirectly determined using the maxi- 
mum principle. 

After introducing the maximum principle, we shall first solve a con- 
strained matched filter problem and then a signal design problem. Atten- 
tion will be focused on the reformulation of the problems so that the 
maximum principle is applicable. 

II. THE MAXIMUM PRINCIPLE 

We shall briefly discuss the maximum principle. Our discussion is an 
abstraction of some material in Ref. 3. Another excellent introduction 
to the maximum principle, which is presently available, is the Introduc- 
tion and Chapter I of Ref. 4. Consider a system whose state is described 
by a vector x = (x\ , x% , • • • x„ ) which satisfies the differential equation 

*-/(aj,ii,0 tt[t ,U] (1) 

where u = (wj , • • • , u r ) is an r-dimensional control vector and f(x,u,t) = 
(Ji(x,u,t), f 2 (x,u,t), ••■ , f n (x,u,t)) is a given n-dimensional vector 
valued function of x, u, and t. We require that 

u(t) eQ te[to,k] (2) 

where the set ft is the set of admissible control vectors. Let F be the 
class of all piecewise continuous functions from [t , h] into ft. If u is 
a control function in the class F we denote the trajectory corresponding 
to u by x(t\u) which satisfies the following relations 

x(t; u ) = f(x(t;u), u(t), t) a.e. t e fe, , h] (3) 

x(t ; u ) = 0. (4) 

* Corinthian script (e.g., «,<J) is used to denote control functions. Small Eng- 
lish letters (e.g., u\t), v(t)) denote values of functions at specific times. The 
function « is the function whose value at time t is u{t). 
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(There is no loss of generality in assuming zero initial conditions; a 
transformation of coordinates may be used for nonzero initial condi- 
tions. ) 

The optimization problem is as follows. Let { Si , S2 , ■ • • , s m ] be a given 
set of real numbers where ^ m ^ n — 1 . We prescribe the final values 
(at t = t\ ) of the first m coordinates of the state vector x to be 

[Si , Si , • • • , Sm) 

and we require the final value of .r„ to be maximum. The optimization 
problem is formally stated as follows: we are given the set 

S = {x: Xi = Si for i = 1, • • • , m] (5) 

and we want to find a control function «? in the class F such that 
(») X(ti ',*) eS 

(ii) for all « e F such that (6) 

x(k; u )tS (7) 

the following relation holds: 

x n (k; u ) ^ x n (h;*). (8) 

The control function t is called the optimal control function and .r (£;*?) 
is the optimal trajectory. 

The Pontryagin maximum principle is a necessary condition that an 
optimal control function must satisfy. To state the principle, we first 
define the Hamiltonian, H(x,u,t,p), 

H(x,u,t,p) = (f(x,u,t)\p) (9) 

where p = {p x , ■ ■ ■ , p n ) is an n-dimensional vector and (a \ b) denotes 
the scalar product of a and b. 

2.1 Maximum Principle 

If ^ is an optimal control function then there exists a nonidentically zero 
continuous vector valued function p(t) such that 

(0 II(x(t;4), v(l), I, p(t)) ^ H(x(t;*), u, t, p(t)) 

for a.e t e [l , d] and all u e fl, (10) 
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(ii) p(t) = - 



'dfMt),t)Y mp(t) (11) 



for a.e. t e [k , h] 

(superscript T denotes transpose), 

(Hi) piih) = i = m + 1, • • • , n - 1, (12) 

(«0 p n (h) ^ 0. (13) 

Relation (i) states that the Hamiltonian, evaluated along the optimal 
trajectory, takes on its maximum value with v(t). Note that the 
maximization is over u, with x(t;i) and p(t) held fixed. 

Relation (ii) may alternatively be expressed as follows: 

m _ _ trS&KW)] . PlU ), i = l,t,...,n. (14) 

Relation (Hi) states that the final value of an element of the vector 

p(t) is zero if it corresponds to an element of the vector x(t) which is 

left free at t = h . 
Relation (iv) states that the nth element of p(t), which corresponds to 

the element of x (t) which is being maximized at t = t\ , is nonnegative 

at t = t\ . 

Verification that the maximum principle is satisfied is seen to be 
equivalent to verification that a set of differential equations with mixed 
boundary values is satisfied. Conditions on x(t\*) must be satisfied at 
both to and h and p(t) must satisfy conditions at t\ . This boundary 
value problem is not always solvable analytically but much progress 
has been made in the numerical solution of such problems. 

Communication theory problems are not usually stated in the form 
just described with differential equations. So the first order of business 
in applying the maximum principle to a communication theory problem 
is to convert it into the appropriate form with differential equations. 

III. A MATCHED FILTER PROBLEM 

To illustrate the formalism involved, we first solve a variation of the 
matched filter problem. The use of the maximum principle, in this case, 
is actually equivalent to using the classical calculus of variations. The 
basic matched filter problem is as follows (Ref. 5, p. 244). We have 
signal, Si(t), and noise, n{t), entering a linear filter and we wish to de- 
sign the linear filter so that the output signal-noise ratio is maximized 
at a specific time, t x . This problem can be trivially solved using the 
maximum principle and by other methods. To make the problem a 
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little more interesting, suppose that we also specify that the output to a 
second signal input, s 2 (t), is to be equal to some real number, a. For 
example, if we chose a = 0, we could be interested in detecting the 
presence of Si(t) while discriminating against s 2 (l). 
We assume white noise with correlation function 

R n (t - u) = N8(t - u). (15) 

Then the mean square noise at h , <r 2 (Ji), is (if we start the problem at 
t = and if we employ the usual formal operations with white noise) 

<r 2 (h) = N ( l h\r)dr (16) 

where h(t) is the impulse response of the linear filter. The outputs due to 
8i(t) and s 2 (0 at t = ti are, respectively, 

Vi(h) = / h(r) Siih - r) dr (17) 

yM = \ h(r) S 2 (ti - t) dr. (18) 

The problem is then to choose h(t) to maximize 

Mdf (1 

a 2 («i) U9) 

while satisfying the relationship 

V% (h) = a. (20) 

If we let 

Mt) = w(0*i(<i - 0. (21) 

MO = u(t)*(ti - 0, (22) 

Mt) = -Nu 2 (t), (23) 

^i(O) = .r 2 (0) = .r 3 (0) = 0, (24) 
and if we identify u(t) with h(t), then 

Mh) = mih), (25)* 

Mb) = Viih), (26) 

Mb) = -<r 2 (k). (27) 

* Note that for t ?* U , xi(t) does not necessarily equal yi(t). 
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An equivalent problem is to choose u(t) so as to maximize xa(h) sub- 
ject to 

Mh) = 1 (28) 

xzik) = a. (29) 

That is, we can minimize aih) with yi(ti) constrained since the signal- 
noise ratio is not changed if h(t) is multiplied by a constant factor. 

Now that we have recast the problem into differential equation form, 
we can solve it using the maximum principle. Using (14) we see that, 
since / is independent of x. 

pi(t) = constant = pi (30) 

pi(t) = constant = pi (31) 

Pz(t) = constant = 1 (32) 

(we let p 3 (t) = 1 for convenience). 
The Hamiltonian, H, is 

H = piu(t)si(ti - t) + pwiDstih - t) - Nu 2 (t). (33) 

Since there are no constraints on u(t), we maximize H by differentiating 
and get 

u(t) = ^ [pMk - t) + pMti - Dl (34) 

To satisfy (28) and (29), we must have 

^ [ ' [pMh - r) + P2S2U1 - t)1 Sl (h - t) At = 1 (35) 
2N •'0 

7 r T - T / [piSiih - t) + P2S2U1 - t)] s 2 (/i - t) dr = a. (36) 

We can then solve for pi and p 2 and get 

2N(S 2 - aSu) fw * 

Pi = m - sj (37) 

2N(aS l - Su) / q8 N 



* This involves an assumption of normality (in the sense of the classical calcu- 
lus of variations). 
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where 



rti 
Si = / 8i*(«i - r) dr 

S 2 = \ s'ity - r) dr 

Su = / si(/i — t) s 2 (ti — t) dr. 
Jo 



As a simple example, let 



8i(t) = 1 te[0,ti] 

H(t) = 1 te [0,^/2] 
t e (<,/2,«!]. 



(39) 
(40) 

(41) 

(42) 

(43) 



We would then get 



u(t) = h(t) = 2(1 a) Sl (/x - + ? (2a - 1) *(*, - 0. (44) 
h t\ 



IV. A SIGNAL DESIGN PROBLEM 

The following problem is taken from the thesis by M. I. Schwartz 
(Ref. 6). The system is depicted in Fig. 1. We have a signal passing 
through a linear time-invariant filter, represented by impulse response 
function h(t), after which the signal is corrupted by noise. The resultant 
signal plus noise is processed by a correlation -type receiver. The object 
is to maximize the signal-noise ratio at the output of the receiver at 
t = ti by choosing forms for both s(t) and receiver function q(t). M. I. 

CHANNEL 



SCt). 



h(t) 



x(t) r~\ y (t) 




n(t) 



L. 



y(t)qCt)dt 



Fig. 1 — Signal design problem. 



f 
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Schwartz solves this problem with an energy constraint on s(t). We will 
show how to simultaneously handle an energy constraint and an ampli- 
tude constraint on s(t). That is, we require that 

*' s 2 (r) dr = S (45) 

and 

\s(t)\ ^ S max . (46) 

To make the problem meaningful, we require that 

(Smax) h > £• 

It may be easily shown that the problem is equivalent to the problem 
with the equality of (45) replaced by ^. 
Again, we assume that the noise is white, i.e., 

R n (t - u) = N8(t - u), (47) 

and we further specify that h(t) has a rational Fourier transform with 
just poles* for simplicity. The assumption of rational Fourier transform 
facilitates recasting the problem into the differential equation form. Thus, 

fffott)] = 7^ (48) 

where a is a real number and D (ia) is a polynomial in z'w. Letting 
D(ia}) = (iw) 1 + ai(i'a))'" 1 + «h(iw) M + • • • + Oi-ifaO + «i i (49) 
(i.e., we have an Zth order differential equation in h(t)) and 

Xl = x (50) 

xt = x (51) 



Xi = x 



«-» (52) 



we can represent the effect of h(t) by the following set of first-order 
differential equations 



* If we also assumed zeros in the Fourier transform, we would solve for a de- 
rivative of h(t). 
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as 



(53) 



= 3,(0) = 0. 



We have converted an Zth order differential equation into / first-order 

differential equations. 

Let 



x* = 



Xi 

•To 



Xl 



(54) 



and let B be the Z X I matrix in (53). Then (53) may be more concisely 
written as 



x* = B x* + 







as 



(55) 



.r*(0) - 0. 

Now that we have put the effect of h (t) into differential equation form, 
it remains to cast the signal-noise considerations into differential equa- 
tions. Recall that the object is to maximize [so(ti)] 2 /<r 2 (ti) subject to 
(45) and (46) where 

so(ti) = J dtq(t) I dus{u)h(t-u) (56) 

Ah) = N f At) dt. (57) 



If we let 



zi(t) = s 2 (t) 



(58) 
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h(t) = Nq\t) 

k{t) = x x {t)q{t) (z 3 (h) = s (O) 

•4(0 = 2z 3 (t)x 1 (t)q(t) Mh) = s 2 (h)) 
21 (0) = 2 2 (0) = 2 3 (0) = 2 4 (0) = 
control vector = u{t) = (s(t),q(t)), 



(59) 

(60) 

(61)* 

(62) 



then the optimization problem is to choose s(t), subject to relation 
(46), and q(t) to maximize Ztih) (which equals s 2 (h)) subject to 
zi(h) = S and z 2 (fi) = a 2 . That is, we fix a 2 (t x ) at some arbitrary real 
number and maximize s 2 («i). Just as in the matched filter problem, 
multiplying q(t) by a constant does not affect the signal -noise ratio. 

Now our state vector is the (I + 4) -vector, (x*, Z\ , z 2 , Zz , z A ). Equa- 
tion (14) will take the following form (p(t) is an (I + 4)-vector): 



Pit) = ~ 



(B is the I X I matrix in (53 ) 
The final conditions on p (0 are 



r rt i 
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2z 3 q 
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(53)). 










(t) are 










Pi(h) 


= 








Pi(ti) 


= 









pit) 



(63) 



(64) 
(65) 



Viiti) = 0, 



(66) 



pi+iih), pt+iit*) unspecified (they correspond to z x (t) and z 2 (t) which 
are fixed at t — U.) 



* This differential equation is derived as follows: 

dt (2«(0) = 2z 3 (t)z 3 (l) = 2z 3 (t)x l (t)q(t) 

Z A {t\) = Zsfti) = So(/i). 
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Pi +3 (h) = (67) 

Pi+t(k) ^ 0. (68) 

An example of the use of the maximum principle for this signal design 
problem is given in the next section. 

V. EXAMPLE OF SIGNAL DESIGN 

We shall consider the case of 

8flK0]«T-4—. (69) 

ico -f- a 

Following the method described in Section IV, the problem is first re- 
cast into the following differential equation form 

x = — ax + as (70) 

ii = s 2 (zi(h) = S) (71) 

i 2 = Nq 2 ( 22 (h) = <r 2 ) (72) 

2a = xq (z 3 (h) = s (h)) (73) 

i t = 2z 3 xq (z 4 (ti) = So 2 («,)) (74) 

x(0) = Zl (0) = zo(0) = z 3 (0) = z 4 (0) = 0. (75) 

For this problem the necessary condition (maximum principle) for an 
optimal solution is that there exists a nonzero vector function p(t) = 
(pi(0i Pi(0i ' ' ' i P»(0) such that 

H = pi (-ax + as) + p-is 2 + pzNq 2 + p A xq + p b 2zixq (76) 

is maximized over the allowable s and q and such that 

p\ = api — qp A — 2z 3 q (77) 

Ps = (78) 

Pz = (79) 

p 4 = -2sg (80) 

p\ = (81) 

Pi«i) - (82) 

p 4 (/i) = (83) 

Pt(k) ^ (84) 
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(we can let p b (ti) = p 6 (t) = 1 under a normality assumption). 
The maximization of H leads to 

fl (0 = - [ P4( ° X(t) 2 ^J Z3(t) — ] (85) 

a(*) - ••(<) if | s*(0 I ^ &— (86) 

^ ;/i iS M x if | «*(0 !>&»„, (87) 



* 



«•(«) 



where 



■(f) = _Pl(0a 



2p 2 

To verify satisfaction of the maximum principle, it is necessary to 
solve the differential equations (70) to (74) and (77) to (81) with 
satisfaction of the above mentioned initial and final conditions and in 
such a way that maximization of the Hamiltonian is satisfied. 

The numerical method of satisfying the maximum principle is based 
on iteration of the initial values of p-vector to successively improve the 
final conditions. That is, we know the initial conditions of x, Z\ , z 2 , 
z 3 , Zi (see (75)) and we wish to constrain the final values of Z\ , z-i , 
p x , and pi : 

Zi(ti) = 8 (88) 

z,{k) = cr 2 (89) 

Pi(ti) = (90) 

p«fo) = 0. (91) 

Suppose we guess at p(0) = (pi(0), pt(0), Pa(0), P4(0), 1) and inte- 
grate the differential equations (70) to (74) and (77) and (81) and 
evaluate the following error in final conditions 

E = | Zi{k) - £ I + I *(h) - S I + I Pi(h) I + I P*(h) I (91 )t 
We wish to decrease E. To this end, let 

(Pl(0))„ew = (Pl(0))old + «pi(0) (92) 

and re-integrate the differential equations. If E decreases, change 

* Since j>t(t) + 2st(0 = 0, q(t) is proportional toa;(/)- Thus, we are equivalently 
maximizing the signal energy into the receiver (see (73)) and then correlating 
with x(l). This is consistent with (and, in fact, rederives) well-known properties 
of matched filters. 

t Actually, the second term is not required as a 2 can be adjusted by changing 
q(t) by a multiplicative factor (without changing the signal-noise ratio). 
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p 2 (0). If E does not increase, try 

(pi(0)„ew = (pi(0))oid - «pi(0). 

Again see whether E has decreased. If it has, change p x (0) to (pi (0) ) new 
and change 7*2(0). If £ has not decreased, retain (pi(0)) o id and try 
changing p 2 (0). 

Thus, the method is to successively change pi(0), P2(0), P3(0), p4(0) 
to decrease E. When E becomes sufficiently small, the maximum prin- 
ciple may be said to be satisfied. After we present some results, we will 
discuss the method further. 

5.1 Results 

Two cases were run. They were for the following parameters: 

k = 1 

8 = 1 

N = 1 

a = 1. 

(As mentioned previously, o- 2 is determined by the scaling of q(l).) The 
difference between the two cases is that in the first, the amplitude con- 
straint was not imposed and in the second, S mBX was set at 1.1. The first 
case was already treated by other methods in Ref . 6. Our results for that 
case were in agreement with those of Ref. 6. Fig. 2 shows q(t) for both 
cases and Fig. 3 shows s(t). For the case of no amplitude constraint the 
signal-noise ratio (Vs 2 (h)/o- 2 ) was 0.44 and the signal-noise ratio for 
the amplitude constrained case was 0.43. One could (nonoptimally) im- 
pose the amplitude constraint by scaling down the results for the ampli- 
tude unconstrained case (and not use all the signal energy available). 
That is, the peak amplitude of the signal in the first case is 1.27. The 
entire signal (s(t) for t e [0,ti]) could be reduced by a factor of 1.1/1.27. 
The signal-noise ratio would also be reduced by that factor (0.865). 
Whether or not this signal-noise ratio reduction is significant is not 
actually germane to this investigation. What is of consequence is the 
fact that the optimum can be determined and any sub-optimum scheme 
can be compared with it. 

5.2 Comments on the Numerical Method 

The basic method is similar to that of Ref. 7. In Ref. 7, the gradient 
(relating changes in the error to changes in p(t )) is evaluated and used 
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Fig. 2 — Optimum correlation waveforms. 

to determine a steepest descent change in p(t ). This was not done in 
the present problem for two reasons. First of all, the evaluation of the 
gradient (as done in Ref. 7) is not valid for the problem with inequality- 
constraints. Secondly, even if the gradient can be conveniently evalu- 
ated, it still requires extra integrations and the problem of step size is 
left unresolved. (This is not intended as criticism of the method of Ref. 
7 which may be quite useful in many applications.) We decided to 
frankly treat the problem as a systematic trial and error. Our method of 
seeing how the error changes as Pi(t ) is changed to Pi(t ) + 5p«('o) 
may be loosely interpreted as evaluating dE/dpi(t )-8pi(t ). 

The numerical method may be considered to be semiautomatic. There 
is little a priori information available as to the initial choices of the 
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Fig. 3 — Optimum signal waveforms. 



Pi (U) and the 8pi(t ). A few runs on the computer offer the optimizer 
some insight as to appropriate choices. About 30 iterations were found 
to be needed for convergence (approximately 0.01 hours of computer 
time). No convergence proof is offered for this method. In fact, even 
though it did not happen in the problem considered in this paper, it is 
conceivable that dE/dpi(t ) (assuming the derivative exists) can be so 
large that the smallest 8pi(t ) that can be used by the computer would 
result in a much too large change in E. There is also the possibility of 
local minima of E (with E > 0). These problems (which may not even 
occur; we are trying to anticipate the worst) could be presumably re- 
solved by changing the metric defining E and by trying a wide range of 
Pi (to).* It may be noted that convergence proofs do not seem to be avail- 



* More efficient methods of adjusting the p<(to) may be possible. See, for ex- 
ample, Wilde, D. J., Optimum Seeking Methods, Prentice-Hall, 1962. 
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able for competitive algorithms (e.g., steepest descent) for these op- 
timization problems. 

VI. CONCLUSION 

The maximum principle has been used here to attack two signal-noise 
ratio maximization problems. The first one (matched filter problem) 
could have been solved without the maximum principle. However, the 
maximum principle provided a very neat and systematic approach. The 
second problem (with the amplitude constraint included) appears to be 
unsolvable except by the maximum principle.* In this paper, it was as- 
sumed that the noise is white. The handling of non-white noise merits 
further attention, in particular, the conversion to differential equations 
and the presence of impulses (see Ref. 5, Appendix 2). It should also be 
noted that the maximum principle is not conceptually limited to time- 
invariant problems. 
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* Another possible approach to the problem would be to assume that s(t) = 
Smaxsin [e(t)] and use classical methods to solve for 6(t) (since the problem of 
inequality constraint is thereby avoided). However, the maximum principle 
attacks the problem directly. 



SIGNAL-NOISE RATIO MAXIMIZATION 489 



RECENT RELATED REFERENCES* 

1. Schweppc, F. C. and Gray, D., Radar Amplitude Modulation Using Pontryagin's 

Maximum Principle, Lincoln Laboratory preprint, 1965. 

2. Athans, M. and Schweppe, F. C, On the Design of Optimal Modulation Schemes 

Via Control Theoretic Concepts. I: Formulation, Lincoln Laboratory pre- 
print, Aug. 31, 1965. 

* I wish to thank Prof. M. Athans of MIT for sending these references. 



